Spoken word controlled automatic dialer

ABSTRACT

A speech controlled dialing circuit identifies input utterances which may be a command word (mode select), repertory word (dialing name or number), or nonrecognized (&#34;Other&#34;). Responsive to the identification of each occurring input utterance, a set of predetermined templates are selected to identify the next occuring utterance. A programmed microprocessor system is described to implement the main controller function.

TECHNICAL FIELD

Our invention relates to automatic dialing in communication systems andmore particularly to speech controlled automatic dialing arrangements.

BACKGROUND OF THE INVENTION

Automatic and repertory dialing arrangements permit telephone systemsubscribers to access frequently called telephone numbers without timeconsuming and errorprone manual dialing. Such dialers find widespreaduse in the business environment where efficient utilization of telephonecommunication is economically important. Automatic and repertory dialingalso permits handicapped persons who find it difficult or impossible toperform the manipulations required for manual dialing to use telephonecommunication without assistance.

While the number of manual operations needed to complete the dialing ofa telephone number is significantly reduced in known repertory dialingsystems, manual operations have not been eliminated. Some manipulationof the dialing apparatus is required for each call and additional manualprocessing is needed to insert or change the repertoire in the dialingdevice. It is desirable to design automatic and repertory dialers sothat manual operations are completely eliminated. Speech recognitionapparatus has been utilized to insert information into data processingequipment and to control mechanical devices without direct physicalcontact. Prior art speech recognizers, however, have been able tooperate only with very restricted vocabularies.

U.S. Pat. No. 3,742,143 issued to M. Awipi on June 26, 1973 and assignedto the same assignee discloses a limited vocabulary speech recognitioncircuit for machine and telephone control in which an utterance isrecognized as one of a limited set of vocabulary words. Detection ofpredetermined words may initiate telephone operations such as dialing.Awipi, however, requires each utterance to be recognized as one of theentire set of stored reference words. Any increase in the size of thestored reference words to incorporate an extensive list of names oradditional commands substantially increases the difficulty of automaticrecognition and reduces the accuracy of utterance recognition.

U.S. Pat. No. 4,164,025 issued to John J. Dubnowski and Aaron E.Rosenberg Aug. 7, 1979 and assigned to the same assignee discloses aspoken letter recognition arrangement designed to provide automaticdirectory assistance to subscribers over a telephone connection. Thisarrangement, however, requires spelled input in order to obtain a highdegree of recognition accuracy. Other speech recognizers are similarlyrestricted in vocabulary for accurate recognition. The article"Computers that Talk and Listen: Man-Machine Communication by Voice" byJ. L. Flanagan, Proceedings of the IEEE, Vol. 64, No. 4, April 1976, pp.405-415 describes a recognition technique in which information is inputto a computer via a man-machine dialog. A predetermined sequence ofsynthesized messages are generated. For each message, the recognitionvocabulary is restricted to an answer ensemble appropriate to thesynthesized message inquiry. These and other prior art recognizers arenot adapted to automatic and repertory dialing wherein changing dialervocabulary must conform to the user, an extended dialog is inefficient,and in which the error rate as well as the repeat dialing rate must bevery low. For efficient operation, a spoken word recognizer is requiredto respond to any user utterance, to distinguish between speech meant toactivate dialing and other speech and noise in factory, office, or homeenvironments, and to accommodate a changing vocabulary of diverse names.It is an object of the invention to provide an improved automaticdialing arrangement which is controlled by spoken words.

BRIEF SUMMARY OF THE INVENTION

This invention is directed to a speech responsive automatic dialingcircuit, in which templates representative of the acoustic features ofcommand words and repertory reference words are stored. A directorystores a set of dialing signals corresponding to the repertory words.Responsive to each input utterance, a speech analyzer produces a signalrepresentative of the acoustic features of the utterance. Jointlyresponsive to the stored template signals and the utterance acousticfeatures signal, a spoken word recognizer generates a signal identifyingthe input utterance. Upon the identification of an input utterance asone of the repertory words, the corresponding dialing signal isretrieved from the directory store. Responsive to the identification ofan input utterance, a set of predetermined template signals are selectedto identify the next occurring input utterance.

According to one aspect of the invention, a control signal correspondingto each utterance identifying signal is generated. Signals to addresstemplates in the template memory are produced responsive to the nextoccurring input utterance. Jointly responsive to the utteranceidentifying control signal and the memory addressing signals, apredetermined set of template signals are applied to the spoken wordrecognizer.

According to another aspect of the invention, the template signals forall templates in the template memory are sequentially producedresponsive to the next occurring utterance. A set of gating signals arestored for each utterance identifying control signal. The gating signaladdressed by the control signal and each template signal is applied tothe spoken word recognizer to gate only the predetermined templatesignals to the word recognizer.

According to yet another aspect of the invention, the speech analyzer isenabled for a preselected time interval responsive to each repertoryword utterance identifying signal. A predetermined set of preselectedtime interval template signals are applied to the spoken word recognizerto identify the next occurring utterance. The dialing signal for therepertory word utterance identifying signal is outputted uponidentification of the next occurring utterance as other than one of thepredetermined set of the preselected time interval template signals.

According to yet another aspect of the invention, a first signal isgenerated responsive to each command word utterance identifying signal.Jointly responsive to the identification of the next occurring utteranceas other than one of the predetermined template signals selected by thecommand word utterance identifying signal, said command word utterancepredetermined template signals are selected to identify the utteranceimmediately succeeding said next occurring utterance.

According to yet another aspect of the invention, a set of isolatedcommand word and repertory word template signals are produced responsiveto a cued sequence of utterances by a speaker. The isolated wordtemplate signals are inserted in the template signal memory whereby thespoken word dialing is sensitive only to isolated utterances of thesereference words.

In an embodiment illustrative of the invention, isolated word templatesignals for a set of command words and a set of repertory words aregenerated responsive to dialer cued utterances by a predetermined user.The dialer is then switched to a rest mode in which the spoken wordrecognizer is operative to recognize the next occurring utterance of theuser as one of the first predetermined set of isolated command wordtemplates, "off-hook," "train" or "directory." Detection of utterancesof other speakers or conversational utterances of the user are effectiveto reset the dialer to recognize the succeeding utterance as one of thefirst predetermined set of template signals.

Recognition of an isolated user utterance as the command template"off-hook" switches the dialer to its dial mode and causes the wordrecognizer to recognize the next occurring utterance as only a repertoryword template. The repertory word templates consist of repertory namesor repertory number sequences. If "stop" is detected, the dialer isreset to recognize the succeeding utterance as one of the command words"off-hook," "train," or "directory." When a repertory name isrecognized, the voice response circuit feeds back the name recognized tothe user and enables the speech analyzer to receive a short preselectedtime interval utterance (e.g. 2 seconds). The speech recognizer is setto compare the short utterance to only templates for the short durationcommand words "stop," and "error".

Detection of other than the two short interval command words "error" or"stop" causes a dialing signal corresonding to the repertory name to beretrieved from a directory store. The dialing signal is then supplied tothe user telephone line. Recognition of the short utterance as "error"resets the dialer to recognize the succeeding utterance as one of therepertory word templates. Detection of the shortened utterance as "stop"returns the dialer to its rest mode. After the dialing signal isobtained, the dialer is switched to its call state and the recognizer isconditioned to detect an input utterance as the isolated word template"hang up." Other utterances or "hang up" in conversational speech by theuser resets the dialer to compare the succeeding utterance to theisolated "hang up" command template. Recognition of the isolatedutterance "hang up" by the user terminates the call state and resets thedialer to its rest state.

DESCRIPTION OF THE DRAWING

FIG. 1 depicts a general block diagram of a spoken word dialing systemillustrative of the invention;

FIG. 2 shows a block diagram of a controller circuit useful in thedialing system of FIG. 1;

FIG. 3 shows a template control logic circuit used in the dialing systemof FIG. 1;

FIG. 4 shows a voice response circuit useful in the dialing system ofFIG. 1;

FIG. 5 shows a spoken word recognizer circuit useful in the dialingsystem of FIG. 1;

FIGS. 6 and 7 show memory arrangements used in the template controllogic circuit of FIG. 3;

FIGS. 8 and 9 show a flow chart illustrative of the operation of thecircuit of FIG. 1;

FIGS. 10-13 show waveforms illustrating the operation of the circuit ofFIG. 1; and

FIG. 14 shows the arrangement of FIGS. 8 and 9.

DETAILED DESCRIPTION

FIG. 1 depicts a general block diagram of a repertory dialerillustrative of the invention. In FIG. 1, speech analyzer 110 isoperative to receive signals from microphone 101 and to generate a setof acoustic feature signals responsive to each utterance obtained fromthe microphone. As is well known in the art, speech analyzer 110 may beadapted to produce various types of utterance feature signals such asformant, spectral, or prediction analysis feature signals. Wordrecognizer 130 is jointly responsive to the acoustic feature signalsfrom speech analyzer 110 and the sequence of acoustic features forreference words stored in template memory 120 to identify the inpututterance.

Control unit 150 is connected to analyzer 110, word recognizer 130, andtemplate logic 140. The controller is adapted to control the recognitionprocess in accordance with a set of permanently stored instruction codeswhich instruction codes direct the various phases of the repertorydialer operation, Speech analyzer 110 and control 150 are connected tovoice response circuit 180 so that the succession of identifiedutterances may be stored and retrieved from storage to produce speechwaves via loud speaker unit 190. Controller 150 is also connected todirectory store 170 which contains the dialing codes corresponding tothe stored template signals in template memory 120 and is furtherconnected to dialer 160 which dialer is adapted to provide dialingsignal sequences corresponding to the codes from directory store 170.

Speech analyzer 110 is operative to receive electrical signals frommicrophone 101 and to transform said signals into utterance featuresignals. In the circuit of FIG. 1, linear prediction analysis ispreferred but spectral or other analysis arrangements may be used.

Template memory 120 contains acoustic feature templates corresponding toreference utterances previously applied by a user to speech analyzer110. The reference utterances include a set of command words used tochange the operating modes of the circuit of FIG. 1, a set of spokenname templates corresponding to other subscriber telephone stations towhich access is desired and a set of spoken number templates for use inthe construction of automatic dialing codes. The command word setincludes templates for words such as "off-hook," "directory," "modify,""stop," "error," and "hang up." The acoustic feature templates in memory120 are placed in sequential positions which are accessed responsive tosignals from recognizer 130. While the templates may be placed in anyorder, it is preferable to place the set of command words in the firstpositions of memory 120. The spoken number template positions arelocated after the command word positions and the spoken name templatepositions follow the spoken number positions. The aforementioned orderpermits the addition of spoken name templates to memory 120 withoutalteration of the positions of previously recorded templates.

When not in use, the circuit of FIG. 1 is placed in its rest mode duringwhich speech analyzer is adapted to accept utterances from microphone101. Feature signals generated from the accepted utterances are comparedto a restricted set of template signals from memory 120. The flowdiagram of FIGS. 8 and 9 illustrate the general operation of the circuitof FIG. 1. As illustrated in box 801 of FIG. 8, the restricted set oftemplate signals in the rest mode consists of templates for the commandwords "off-hook," "train," and "directory." Word recognizer 130 isoperative to sequentially compare the template acoustic feature signalsfor the words of the restricted group to the acoustic features of theinput utterance from analyzer 110. As a result of the comparisonsbetween the input utterance and the selected set of templates, a codedidentification signal WC is generated in recognizer 130 which signalidentifies the input utterance as the closest corresponding template orindicates that none of the selected templates correspond to the inpututterance acoustic features.

In the event the template "off-hook" is identified as the inpututterance in the rest mode, controller 150 is switched to its dialingmode (box 830 of FIG. 8) during which a dialing sequence is permitted.Identification of the template "directory" switches controller 150 toits directory mode (box 950 of FIG. 9) during which additional names canbe inserted into the dialing repertoire, modifications can be made toexisting entries of the repertoire, or deletions can be made to thedialing repertoire. Recognition of the utterance as the template "train"causes controller 150 to assume its training mode state (box 850). Thedialing arrangement of FIG. 1 then generates prescribed template signalsfrom the cued utterances of a user so that it is adapted to recognizeisolated word utterances of the user. Where other than these threetemplates is identified in recognizer 130 in the rest mode, thecontroller remains in its rest mode to process recognition of additionalutterances as one of the predetermined set "off-hook," "directory," or"train."

The dialing mode is initiated by a WC signal from word recognizer 130which corresponds to the spoken word "off-hook." In the initial phase ofthe dialing mode, speech analyzer 110 receives input utterance signalsfrom microphone 101 and supplies feature signals for the utterances toword recognizer 130. Template control logic 140 is set by controller 150to restrict the templates accessed from memory 120 to repertory numberand repertory name templates as illustrated in box 811 of FIG. 8. Uponrecognition of the utterance as a spoken digit, template control logic140 is altered so that the next group of utterances is compared only tospoken digit templates from memory 120. Upon recognition of the lastexpected digit utterance, controller 120 signal voice response circuit180 to generate the phrase "I have recognized the number" followed bythe digit utterances just recognized. The resulting audio signal is sentto the user so that he may verify the recognized digits.

After the number is fed back to the user via voice response circuit 180,speech analyzer 110 is conditioned to receive a shortened utterance in alimited verification interval (box 823) which utterance may be "stop,""error" or other than these words. Control logic 140 signals wordrecognizer 130 to compare any input utterance during the prescribedverification period to the command words "stop," and "error."

In the event that the template for the command word "stop" is recognizedby word recognizer 130, the "stop" WC identification code returned tocontroller 150 causes controller 150 to reenter the rest mode.Recognition of the command word "error" causes controller 150 to revertto the initial phase of the dialing mode so that the erroneouslyrecognized number may be repeated. In the absence of recognition ofeither command word in the verification interval, the WC signal returnedfrom recognizer 130 causes controller 150 to transfer the previouslyreceived number code to Directory Store 170 to address the correspondingdialing code (box 817). The dialing code corresponding to the identifiednumber is then supplied to dialer pulse former circuit 160 viacontroller 150.

If a spoken name is identified by word recognizer 130 responsive to aninput utterance during the initial phase of the dialing mode, the WCidentifying code for the recognized name is supplied to controller 150.Controller 150 sends an emit signal EM to voice response circuit 180together with a U_(n) utterance selection signal. Voice response circuit180 is responsive to these signals to emit an audio signal "I haverecognized the name" followed by the name corresponding toidentification code WC. The information needed to generate the audiosignal for the WC identified name was previously stored in voiceresponse circuit 180 during a preceding training or directory mode.

Upon termination of the audio signal formation, controller 150 isresponsive to the VRD ending signal from circuit 180 to enable speechanalyzer 110 to receive a short duration utterance for verification ofthe recognized name. Template control logic 140 is then set to compareany utterance received during the prescribed verification period withthe templates for the words "stop" and "error" (box 814). If recognizer130 identifies an utterance in the verification period as "stop",controller 150 is placed in its rest mode. The recognition of anutterance in the verification period as "error" places controller 150 inthe initial phase of the dialing mode so that the name may be repeated.In the absence of recognition of the utterance in the verificationperiod as "stop" or "error", a verification code is transferred tocontroller 150. Directory store 170 is then addressed by signal DA toretrieve the dialing code corresponding to the identification code WCfor the verified name (box 817). The dialing code is then transferred todialer pulse former 160 which produces the necessary dialing signals forthe telephone instrument.

Upon successful completion of the dialing operation, controller 150 isplaced in a call state (box 819) in which template control logic 140 isset to restrict comparison of utterance feature signals from analyzer110 to the template "hang up". Upon recognition of "hang up" uttered bythe user, the call is terminated and controller 150 is reset to its restmode.

The training mode (box 850) is initiated by the detection of theutterance "train" while the circuit of FIG. 1 is adapted to receive therestricted utterance set consisting of "off-hook", "directory," and"train" in its rest state. Controller 150 is then switched to itstraining state in which a sequence of template signals are generated foruse in future dialing or directory modes. Alternatively, the trainingmode can be started upon initial turn on of the dialer circuit. Voiceresponse circuit 180 is used to direct the user to utter the words forwhich templates are required. These words include a set of commands andthe digits 0 through 9.

The training operation may, for example, generate templates for thecommand word "off-hook." At the beginning of this training sequence,voice response circuit 180 emits the prestored phrase "At the tone, saythe word" responsive to utterance selection signals U_(n) fromcontroller 150. Upon completion of the phrase synthesis, the VRD signalfrom circuit 180 is sent to controller 150. The utterance selectionsignal is then changed so that voice response circuit 180 emits thedesired command word "off-hook." The utterance selection is againchanged responsive to the VRD signal and a tone is emitted by circuit180. The next VRD signal from circuit 180 causes controller 159 toactivate speech analyzer 110 so that the feature signals for the nextutterance corresponding to "off-hook" can be processed.

Upon completion of the utterance analysis signal FI from analyzer 110alters the state of controller 150 so that the template for theutterance is produced by recognizer 130 and stored in template memory120. When the template storage is completed, the WRD signal fromrecognizer 130 causes controller 150 to initiate the training cycle forthe next word. Usually only a single set of templates is needed foraccurate recognition of each utterance. The arrangement of FIG. 1controller 150 may be adapted to provide more than one template for eachutterance. In that event, the training for the command word "off-hook"is repeated. The training mode continues until at least one template isgenerated for each command word and each digit. At the end of thetraining mode the template count is stored in controller 150 and thecontroller is reset to its reset state.

The directory mode is started upon the recognition of the command word"directory" while the circuit of FIG. 1 is in its rest state. In thedirectory mode the circuit of FIG. 1 is adapted to generate templatesfor names added to the repertoire of the dialer, to delete entries inthe repertory dialer, or to modify the telephone number information fora specified name. Upon initiation of the directory mode, speech analyzer110 is conditioned to generate feature signals corresponding to the nextutterance applied to microphone 101.

Recognizer 130 is adapted to detect whether the utterance from analyzer110 corresponds to one of the restricted set "add," "delete," "modify,""stop" or none of these command words as shown in box 952 of FIG. 9.Where the utterance is identified as other than one of these commandwords, controller 150 is reset to the start of directory mode so that itmay receive further utterances. If the utterance is identified as thecommand word "stop," controller 150 is switched to its rest mode. Theidentification of an input utterance as "add" changes the state ofcontroller 150 so that the circuit of FIG. 1 can receive additional nameentries. Templates to be added to the repertoire as generated and storedin template memory 120 and corresponding directory dialing codes areplaced in directory store 170. Recognition of an utterance as "delete,"permits the user to erase templates and directory dialing codes for aparticular name entry. Utterance identification as "modify" permits theuser to change the directory dialing codes in store 170 for a particularname entry in template memory 120.

Assume that a user desires to add a name and associated dialinginformation to the repertoire of the circuit of FIG. 1. Upon recognitionof an utterance as "add," controller 150 sends signals to voice responsecircuit 180 so that the phrase "Speak the name to be added" is generatedin circuit 180 and emitted from speaker 190. Speech analyzer 110 is thenconditioned to produce feature signals for the next utterance receivedby microphone 101 and to transmit to voice response response circuit 180coded speech pattern SP representative of the utterance. This speechpattern is stored in the next free name store position of circuit 180.After generation of the template for the name utterance by recognizer130, the template is stored in memory 120.

After the template for the added name is entered into template memory120 (box 955 in FIG. 9), the state of controller 150 is modified so thatvoice response circuit 180 emits the phrase "Enter number for" followedby the added name speech signal retrieved from storage in the voiceresponse circuit. At this time, the restricted vocabulary is changed tothe set of digits for which templates exist in memory 120 (box 957). Thetelephone number for the added name is then uttered by the user andfeature signals for the uttered digits are applied to recognizer 130.The digit sequence is identified in recognizer 130 after comparison withthe digit templates from memory 120. The identified digits are thenverified (box 958) through the use of voice response circuit 180. Afterverification, dialing codes for the identified digits are stored indirectory store 170. Finally the maximum template number TMAX incontroller 150 is modified to reflect the added name. After the updatingof the maximum template number, controller 150 is placed in its restmode.

Deletion or modification of a repertory dialer entry is accomplished byinitial recognition of the name for the entry to be modified (box 954)or deleted (box 961), verification of the recognized name (boxes 956 and963) and deletion (970) or modification (boxes 957, 958, 970) asrequired. The deletion and modification operations utilize voiceresponse circuit 180 to provide cueing signals to the user as describedwith respect to the add operation. Speech analyzer 110 and recognizer130 are used to identify the utterances of the user.

Controller 150, shown in greater detail in FIG. 2, comprises controlprocessor 210, control memory 201, and input-output interface circuit212. Interface 212 communicates with the other apparatus of FIG. 1 viabus 250 and control modules 221, 223, 225, 227, 229, 231, 233 and 235.Signals from analyzer 110, recognizer 130, voice response circuit 180,template memory 120 and directory store 170 are supplied to processor210 via the control modules, bus 250 and interface 212 as is well knownin the art. Jointly responsive to the permanently stored control signalsin memory 210 and the signals from interface 211, control processor 210provides control and data signals to the control modules via interface212 and bus 250.

Controller 150 may comprise an arrangement of microprocessor integratedcircuit modules such as described in the Microproducts Hardware SystemsReference published by Data General Corporation, Westboro, Mass. andcopyrighted by Data General Corporation, 1979. Other microprocessorsystems well known in the art may also be used. In FIG. 2, controlprocessor 210 and input-output interface 212 may be the micro Nova MP100system processing unit. Control memory 201 may comprise the MP/100 4K/8Kdynamic random access memory and the MP/100 8K programmable read onlymemory. Each of control modules 221, 223, 225, 227, 229, 231, 233, and235 may comprise the model 4222 digital I/O interface described in theaforementioned publication. The operation sequence of controller 150 isdetermined by the permanently stored instructions of the read onlymemory portion of control memory 201. These instructions are listed inFortran language form in Appendix A.

Directory control module 221 is connected between bus 250 and directorystore 170. Responsive to directory store control signals generated inprocessor 210, module 221 provides directory store address signals DAand directory store reading and writing signals DRW to store 170 andtransfer the directory codes DC and directory store done signal DSD fromstore 170 to processor 210 via interface 212. Dialer control module 223receives processed dialing information from processor 210 via interface212 and transfers said dialing information as signal DC to dialing pulseformer 160 under control of memory 201. Mode control module 227 isadapted to store the current mode code signal CM obtained from processor210 and to apply said mode signal to template control logic 140 wherebytemplate selection for each controller state is determined.

Voice response control module 229 receives utterance number U_(n)signals, and record and emit signals RE and EM from processor 210. Voiceresponse done signals VRD are received from voice response circuit 180.Module 229 is adapted to control the selection of utterances in voiceresponse circuit 180 for user cueing and verification. Record templatecontrol module 231 connected between bus 250 and template memory 140 isoperative to direct the operation of template memory. Control signalsRTN and TRW from module 231 cause the templates generated in wordrecognizer 130 to be recorded in memory 140. Transfer control module 233is connected between word recognizer 130 and bus 250. Module 233receives control pulse SA from processor 210 which signal is used toinitiate the recognition operations of word recognizer 130 and receivesthe identification and recognition done signals WC and WRD fromrecognizer 130 which signals are transferred to processor 210 viainterface 212. Manual control module 235 is adapted to permit control bythe installer or user independent of speech signals.

When the repertory dialer is installed and power is turned on, thecontroller and all other portions shown in FIG. 1 are reset by signalRST at time t₁, as shown in waveform 1001 of FIG. 10. Following initialreset, processor 210 is rendered operative to generate a templateinspection command signal TIN responsive to the permanently storedinstruction set in memory 201. Signal TIN (waveform 1003) is supplied toword recognizer 130 between times t₂ and t₃ in FIG. 10 via interface212, bus 250 and transfer control module 233. In recognizer 130 shown ingreater detail in FIG. 5, signal TIN conditions word recognitionprocessor 503 to sequentially receive the templates stored in templatememory 120 as addressed by template counter 507. Word recognitionprocessor may comprise any of the well known recognition arrangementssuch as disclosed in U.S. Pat. No. 4,181,821 issued to Frank C. Pirz andLawrence R. Rabiner on Jan. 1, 1980 or in the article "MinimumPrediction Residual Applied to Speech Recognition" by F. Itakura IEEETransactions on Acoustics, Speech, and Signal Processing. Vol. ASSP 23,pp. 67- 72, February 1975.

The templates in memory 120 are transferred into processor 503 viatemplate feature latch 505. Processor 503 is operative to detect thepresence of templates for the command words and numbers used in thesystem and to return a system trained signal SYT to processor 210 viatransfer control module 233 in the event that valid templates are foundfor command and number words. Signal SYT is shown as waveform 1005between times t₃ and t₄ in FIG. 10. In the absence of an SYT signal,processor 210 is placed in its train mode. The templates for the commandwords and numbers used in the system are then generated responsive tocurd isolated utterances by the user. The train mode may also be startedby closing switch 239 connected to manual control module 235.

Upon initialization of the train mode at a time t₄, control processor210 generates and transfers a CM=TRA command signal (waveform 1007 inFIG. 10) to mode control module 227, a record length RL=L signal(waveform 1009) to record length module 225, as well as a predeterminedutterance number signal U_(n) (waveform 1011) and an emit signal EM(waveform 1013) to voice response control module 229. The utterancenumber signal U_(n) addresses spoken message and number store 430 invoice response circuit 180 shown in FIG. 4 at time t₄. Store 430comprises a read only memory in which predetermined spoken message codesand spoken digit codes are permanently stored. The output of comparator440 enables store 430 if the utterance number code is less than constantK1. Responsive to signal U_(n) the prestored message corresponding toaddress signal U_(n) is transferred from store 430 to speech synthesizer410.

Speech synthesizer 410 is operative as is well known in the art toproduce an audio signal corresponding to the U_(n) message. This audiosignal is then supplied to speaker 190 to cue the user. The firstmessage "At the tone, say the word" is supplied to the user. Upontermination of the message generation at time t₅ in FIG. 10, a VRDsignal (waveform 1015) is returned to processor 210 via voice responsecontrol module 229. Processor 210 under control of memory 201 thensupplies the next U_(n) code (waveform 1011) to memory 430 between timest₆ and t₇ in FIG. 10 so that the first command word "off-hook" isemitted from speaker 190. Upon return of the VRD signal at time t₇ tocontrol module 229, the U_(n) address signal is changed to provide atone from voice response circuit 180.

Responsive to the VRD signal from synthesizer 410 after the tone (timet₉), processor 210 enables speech analyzer 110 with an AE signal(waveform 1017 at time t₁₀). A set of utterance feature signals UF(waveform 1019) corresponding to the next utterance of the user isthereby generated. The UF utterance feature signals are then supplied toutterance feature latch 501 in FIG. 5. At the termination of the featuresignal generation (t₁₁ in FIG. 10), speech analyzer 110 produces an FIending signal (waveform 1021) which signal is transferred to processor210 via record length module 225.

Responsive to the detection of the FI signal in processor 210, wordrecognition processor 503 is enabled to produce a template correspondingto the user's utterance of the word "off-hook." The "off-hook" templateposition of template memory 120 is then addressed by record templatesignal RTN (waveform 1023) from record template control module 231 andthe template feature signals FR corresponding to the command word"off-hook" are stored between t₁₂ and t₁₃. Upon completion of thetemplate storage operation, a WRD ending signal (waveform 1025) isreturned to processor 210 from word recognition processor 503 viatransfer control module 233.

The WRD signal at time t₁₃ causes processor 210 to send signals to voiceresponse circuit 180, speech analyzer 110, word recognizer 130 andtemplate memory 120 so that a template for the next command word"directory" is stored responsive to another user utterance. Theremaining command words required by the circuit of FIG. 1 for automaticdialing are then generated in sequence by recognizer 130 responsive toutterances of the user as directed by voice response circuit 180. Thegeneration and storage of these templates are substantially similar tothat described with respect to the "off-hook" template. Subsequentlytemplates are generated for the digits 0 through 9 as described withrespect to "off-hook." Responsive to the WRD signal (waveform 1025) fromprocessor 503 upon storage of the template for the digit "9," processor210 is placed in its rest mode (time t₁₅ in FIG. 10).

At the start of the rest mode, control processor 210 generates a modecommand signal RES corresponding to the rest mode, a record lengthsignal L to permit receipt of a long utterance by speech analyzer 110,and a last template signal TMAX to designate the final template positionof template memory 140. The RES signal is transferred to mode controlmodule 227 via interface 212 and a bus 250 and is stored in module 227.The record length signal L is transferred from processor 210 to recordlength module 225 and the TMAX signal is transferred from processor 210to transfer control module 233. The waveforms of the rest mode are shownin FIG. 11. Waveform 1101 corresponds to command signal RES. Waveform1103 corresponds to the RL=L signal and waveform 1105 corresponds to theTMAX signal.

The RL=L signal is applied to speech analyzer 110 at time t₁ in FIG. 11and the analyzer is conditioned by signal AE (waveform 1107) fromcontrol module 225 to begin receiving utterances. The CM=RES commandsignal from module 227 is supplied to template control logic 140 at t₁and is effective to restrict the vocabulary of template signals in therest mode to "off-hook," "train," and "directory." The TMAX signal frommodule 233 is applied to word recognizer 130 to identify the occurrenceof the last template stored in template memory 120.

At this time, processor 210 is placed in a wait condition until thefeature signals UF (waveform 1108) of an incoming utterance areavailable at the output of speech analyzer 110 at time t₂. Responsive toany input utterance at microphone 101, analyzer 110 generates a set offeature signals corresponding thereto. The feature signals aretransferred to UF latch 501 of word recognizer 130. After completion ofthe UF feature signal transfer (t₃), analyzer 110 produces an FI controlsignal (waveform 1109) which is supplied to processor 210 via recordlength module 225 and interface 212. Upon detection of the FI controlsignal in the rest mode, processor 210 is operative to generate an SAcontrol signal (waveform 1111 at t₄). The SA control signal is suppliedto word recognizer 130 via interface 212, bus 250 and transfer controlmodule 233. The SA signal resets template counter 507 and flip-flop 510in FIG. 5. Template counter 507 is thereby set to its initial countwhich is supplied as the TN address signal (waveform 1113) at t₅ totemplate memory 120 and template logic control 140 shown in greaterdetail in FIG. 3.

Referring to FIG. 3, template control logic 140 includes comparator 310,command control read only memory (ROM) 320 and template control readonly memory (ROM) 330. Comparator 310 may be, for example, a type SN7485integrated circuit made by Texas Instruments, Incorporated. The ROMS mayeach comprise, for example, a type 74S285 circuit made by TexasInstruments, Incorporated. Command control ROM 320 is addressed bytemplate number TN from recognizer 130 and command code CM=RES fromcontrol module 227. The arrangement or map of ROM 320 is shown in FIG.6. For each state of controller 150, a single bit code is assigned tothe first nineteen templates. A stored "one" in a position of memory 320permits the template addressed in a given state of controller 150 to betransferred from template memory 120 to word recognizer 130. A "zero" ina given position of memory 320 causes a skip signal (SK) to be appliedto recognizer 130 whereby the address template is not supplied to wordrecognizer 130 in the assigned state of controller 150. In this manner,the reference vocabulary templates to which each utterance is comparedare selectively controlled.

In the rest state, the command signal from control module 227 is RESwhereby the first row of FIG. 6 is addressed for template numbers TNfrom 1 to 19. Initially, TN=1 signal is obtained from template counter507. Jointly responsive to the RES command signal from control module227 and the TN=1 signal, the SK1 output (waveform 1117) of commandcontrol ROM 320 becomes a 1 at t₅. Comparator 310 compares the templateaddress number TN with the constant code K=19 from constant codegenerator 350. Generator 350 may comprise, for example a binary switchappropriately connected to a resistor network between a constant voltagesource and ground. The switch may be set to a voltage valuecorresponding to the binary number representation of a constant. ForTN=1, the TGK output of comparator 310 is low whereby NOR-gate 342 isinhibited and NOR-gate 340 is alerted. Since the SK1 signal from ROM 320is high for TN=1 in the rest state, gate 340 is enabled and a low SKsignal is obtained from OR-gate 344. The low SK signal (waveform 1119between t₅ and t₆) is applied to word recognition processor 503 whichmaintains counter 507 in its TN=1 state. The TN address from counter 507is effective to retrieve the first template corresponding to the commandword "off-hook" from template memory 120 and to apply the retrievedtemplate feature signals to TF latch 505.

Jointly responsive to the utterance feature signals UF in latch 501 andthe template feature signals TF in latch 505, word recognition processor503 generates a signal representative of the similarity of the inpututterance to the command word "off-hook." The similarity signals isstored in processor 503 and template counter 507 is incremented to itsnext state by signal IC (waveform 1121) from processor 503. The TNaddress code is thereby changed to 2 at t₆. Responsive to the restcommand signal RES and the TN=2 address, an SKI=1 signal is obtainedfrom command control ROM 320. The SK signal from OR-gate 344 remains lowbetween t₆ and t₇ and the template for the command word "directory" isretrieved from memory 120 as the FT feature signal set.

Word recognition processor 503 is then operative, as is well known inthe art, to produce a signal representative of the similarity betweenthe utterance feature signals in latch 501 and the template featuresignals for "directory" in latch 505. After the TN=2 similarity signalis stored in processor 503, template counter 507 is incremented to itsTN=3 state by signal IC (waveform 1121) from processor 503. Responsiveto the 1 signal in the command control ROM retrieved for the rest modethird template address, the template signal for the command word "train"is retrieved from template memory 140 and placed in latch 505. After thesimilarity signal for "train" is stored in recognition processor 503,template counter 507 is again incremented by signal IC. At time t₈ thetemplate address becomes TN=4.

As shown in FIG. 6, there is a "zero" in each of template positions 4through 19 of the rest mode row. Consequently, a zero SK1 signal isobtained from ROM 320 and a high SK signal appears at the output ofOR-gate 344. Template counter 507 is immediately incremented responsiveto the high SK signal. Gate 520 prevents the TF template signals frombeing applied to processor 503, and processor 503 is inhibited fromgenerating a similarity signal. In this way, the template for thecommand word "add" is skipped. The TN=5 address signal (waveform 1113)from counter 507 in the rest mode between t₉ and t₁₀ also causes a highSK signal to appear at the output of OR-gate 344. Responsive to the highSK signal, word recognition processor 503 provides an IC pulse at timet₉ to template counter 507. The template for the command word "delete"is thereby skipped. In similar manner, the command word templates foraddresses TN=6 through TN=19 are skipped. When the TN=20 address fromtemplate counter 507 is applied to comparator 310 (t₁₂), a high TGKsignal (waveform 1115) is produced. The TGK signal disables NOR-gate 340and alerts NOR-gate 342. For the remainder of the rest mode, the outputof template control ROM 330 determines the state or OR-gate 344.

FIG. 7 shows the arrangement of read only memory 330. For each staterow, there is a single bit used for all name entries in the system. Inthe rest mode, names are excluded from the restricted vocabulary. Thereis a zero in the first row of ROM 330. Consequently, a zero SK2 signalis obtained from ROM 330 in each name position. When the TN address is20 a high SK signal appears at the output of OR-gate 344. Templatecounter 507 is immediately incremented. Gate 520 is inhibited, and thetemplate for the TN=20 address is skipped. All subsequent name templatesare also skipped. In this manner, the utterance feature signals UF arecompared to the restricted group of templates "off-hook," "directory,"and "train" in the rest mode.

When the TN address is TMAX (t₁₅), the output of comparator 515 setsflip-flip 510. Processor 503 is thereby enabled to select the mostsimilar of the selected templates in the rest mode. Identificationsignal WC (waveform 1125 between t₁₆ and t₁₇), corresponding to thetemplate found most similar to the input utterance feature signals UF,is supplied from processor 503 to transfer control module 233. Signal WCis applied therefrom to control processor 210 via bus 250 and interfacecircuit 212. In the event that none of the templates is within aprescribed degree of similarity, a WC=0 identification signal isreturned. A WRD recognition done signal (waveform 1127) is also suppliedto processor 210. Where the word uttered by the user in the rest mode is"directory," the WC identification signal corresponding thereto causesprocessor 210 to be switched to its directory mode.

The circuit of FIG. 1 is conditioned in the directory mode to add,modify or delete entries in the repertoire responsive to spokendirections of the user. In the initial portion of the directory mode,command signal DIR is generated in processor 210 and transferred to modecontrol module 227. A long record length control signal (RL=L) and aspeech analyzer enable signal (AE) are generated in processor 210 andtransferred to record length module 225. Responsive to the AE and RLsignals from module 225, speech analyzer 110 is turned on so thatutterance feature signals are produced responsive to the next utterancesignals supplied from microphone 101. The waveforms of the signals ofthe directory mode are illustrated in FIG. 12. Waveform 1201 correspondsto the command signal CM=DIR between times t₁ and t₁₄. Waveform 1203corresponds to the RL signal and waveform 1205 corresponds to the AEsignal.

When the last utterance feature signal is transferred from analyzer 110to utterance feature latch 501 at time t₂, an FI signal (waveform 1207)is sent to processor 210 from the analyzer via record length module 225,bus 250 and interface circuit 212. Processor 210 is conditioned todetect signal FI and to generate a start recognition signal SA (waveform1209) responsive thereto at time t₃. The SA signal is transmitted viatransfer control module 223 to template counter 507 and flip-flop 510.Signal SA resets flip-flop 510 so that word recognition processor 503and counter 507 are rendered operative. Signal SA also resets counter507 to its TN=1 state at time t₃ whereby the first template in templatememory 120 is addressed. Signal TN (waveform 1211) is also applied tocomparator 310 in template control logic 140 so that NOR-gate 340 isalerted.

Command signal DIR addresses the "directory" row of ROM 320 shown inFIG. 6. Since the TN=1 signal to ROM 320 addresses the "off-hook" firstcolumn, a zero SK1 signal appears at the output of ROM 320. The outputsof NOR-gate 340 and OR-gate 344 are forced high and a high SK signal(waveform 1213) is sent to processor 503 in FIG. 5. The high SK signalinhibits gate 520, prevents processor 503 from generating a similaritysignal for the TN=1 template and causes processor 503 to apply an ICsignal (waveform 1215) to increment template counter 507 to its TN=2state at time t₄. The zero in the second column of the directory row ofROM 320 prevents comparison of the utterance feature signals in latch501 with the template feature signals for the second command word"directory." Similarly, the template for the third command word "train"is skipped.

Jointly responsive to the directory command and the TN=4 address signalat time t₆, a low SK output is obtained from OR-gate 344. Recognitionprocessor 503 is enabled to generate a similarity signal responsive tothe utterance feature signals in latch 501 and the template featuresignals for the command word "add." The template feature signals for"add" are retrieved from template memory 120. The similarity signal for"add" is produced and stored in recognition processor 503 and templatecounter 507 is incremented to its TN=5 state. The one signal stored inthe fifth, sixth, and seventh columns of the addressed directory row inFIG. 6 provide permissive SK signals (waveform 1213) to processor 503for the command words "delete," "modify," and "stop" between t₇ and t₁₀.The template feature signals for these command words are successivelyretrieved from template memory 120 under control of counter 507 andsimilarity signals for these templates are generated and stored inprocessor 503.

The TN=8 through TN=19 positions of the directory row in ROM 320 containzeros. Consequently, the templates for the remaining command words"error," and "hang up," and the digits 0 through 9 are not retrievedfrom memory 120 and no similarity signals for these templates areproduced in processor 503. When the TN=20 address is obtained fromcounter 507 at time t₁₁ in FIG. 12, comparator 310 produces a highoutput whereby NOR-gate 340 is disabled and NOR-gate 342 is alerted. TheSK2 outputs of ROM 330 are then used to control the operation ofprocessor 503.

As shown in FIG. 7, a zero is obtained from ROM 330 in the directory rowwhereby all name templates in memory 120 are skipped responsive to theDIR command word applied to ROM 330. After the last name in memory 120is addressed by counter 507, the TN output therefrom is equal to theTMAX signal stored in transfer control module 233. The output ofcomparator 515 becomes high, flip-flop 510 is set so that counter 507 isinhibited. Word recognition processor is thereby conditioned to generatea WC=0 signal or a WC identification signal (waveform 1217)corresponding to the most similar template signal to the utterancefeatures in latch 501 at time t₁₃. The identification signal is appliedto processor 210 along with a WRD ending signal (waveform 1219) fromprocessor 503 via control module 233.

In accordance with the invention the template control logic of FIG. 4 isoperative in each command word state to select a restricted group oftemplate signals to which the input utterance is compared. Thus, theutterance feature signals in the initial portion of the directory modecan be recognized only as "add," "delete," "modify," or "stop."Advantageously, the template set selection provides more accuraterecognition even with high background noise and extraneous voice signalsfrom other persons in the vicinity of microphone 101. As aforementionedwith respect to other command word states, recognition of the commandword "stop" in the initial phase of the directory mode causes processor210 to switch to the rest mode. Where other than the permitted commandwords are recognized, the WC=0 coded signal sent to processor 210 causesthe processor to return to the initial portion of the directory mode.Assume for purposes of illustration that the command word "add" isrecognized. Detection of the WC signal for the command word "add" causesprocessor 210 to switch into its name state (NAM).

The name state of the directory, mode is initiated at time t₁₄. Thecommand signal CM=NAM (waveform 1201), the utterance address signalU_(n) (waveform 1223) for the phrase "Speak the name to be added," andan emit signal EM (waveform 1225) are generated in processor 210. TheNAM command signal is stored in mode control module 227 and the U_(n)and EM signals are placed in voice response control module 229.Responsive to the U_(n) signal, the coded speech signal sequence for thephrase "Speak the name to be added" is supplied from read only memory430 to speech synthesizer 410. Responsive to control signal EM,synthesizer 410 generates the speech signal corresponding to the codedphrase from memory 430 and transmits the signal to speaker 190. SignalVRD (waveform 1227) indicating the completion of the message is returnedto voice response control module 229 at time t₁₅. Signal VRD is suppliedtherefrom to control processor 210 via interface 212. Assume forpurposes of illustration that the name "Smith" is to be added to therepertoire.

Detection of the VRD signal in processor 210 at time t₁₆ causes theprocessor to generate an utterance identification signal U_(n)corresponding to the next free name entry in spoken name store 420, arecord signal REC (waveform 1229), and an AE analyzer enable signal. TheU_(n) and REC signals are placed in control 229 while the AE signal isstored in record length module 225. The AE signal is supplied to speechanalyzer 110 from module 225 so that analyzer 110 receives the nextutterance signals from microphone 101. Coded speech signals SP as wellas utterance feature signals UF are now generated in analyzer 110. Thecoded speech signals SP (waveform 1231) are supplied to synthesizer 410via gate 450. After processing in the synthesizer, the coded signals forthe name "Smith" are placed in the U_(n) designated address of spokenname store 420. The UF feature signals from analyzer 110 are supplied toutterance feature latch 501. Upon completion of the utterance featuregeneration, signal FI is produced in analyzer 110 at time t₁₇. Signal FIis transferred to central processor 210 via control module 225.

At time t₁₈, processor 210 generates an SA signal (waveform 1209). TheSA signal is operative to enable word recognition processor 503, arecord template address signal RTN (waveform 1233) to select the TMAX+1address in template memory 120, and a template write signal TRW(waveform 1235). The TRW signal causes the template generated for thename "Smith" to be recorded in template memory 120. Recognitionprocessor 503 produces a template corresponding to "Smith" from theutterance feature signals in latch 501. The resulting template issupplied to memory 120 via line 516. The RTN and TRW signals from recordtemplate control module 231 permits the template for the name "Smith" tobe stored in the next available address (TMAX+1) in memory 120. The WRDsignal is generated after storage of the "Smith" template.

After the "Smith" template is stored, processor 210 is switched to itsnumber state and is rendered operative to generate a U_(n) utterancenumber signal corresponding to the message "Enter the number for" and anemit signal which signals are supplied to ROM 430 and synthesizer 410,respectively. These signals as well as a CM=NUM command signal aregenerated at time t₁₉ in FIG. 12. Synthesizer 410 then generates theaddressed message and returns a VRD signal to processor 210 at time t₂₀.Responsive to the VRD signal, processor 210 generates an utterancenumber U_(n) corresponding to the message "Smith" and this message isproduced by speaker 190 responsive to the emit signal EM. After the nextVRD signal is returned to processor 210 at time t₂₁, analyzer 110 isturned on by the AE signal from processor 210. The next utterance atmicrophone 101 can then be converted into feature signals.

The user then utters a string of numbers, e.g. 7-2-1-1-5-2-4 whichcorresponds to the telephone directory number for "Smith." After theutterance feature signals are generated in analyzer 110, the FI signaltherefrom causes processor 210 to generate an SA signal at time t₂₄. Thesignal is transferred to the word recognizer of FIG. 5 via transfercontrol module 233. As previously described, the SA signal resetstemplate counter 507 to its TN=1 state and switches flip-flop 510 to itsreset state. Word recognition processor 503 is enabled to generate asignal representative of the similarity of the first digit of numberstring utterance and the selected templates in the number phase of thedirectory mode. The recognition of the number sequence is generallyindicated in the time interval between t₂₄ and t₃₁ in FIG. 12.

The CM=NUM signal (waveform 1201) from mode control module 227 addressesthe number row of command control memory 320 and the TN=1 address signalfrom counter 507 addresses the first column of memory 320. The zero SK1output of memory 320 enables gate 340 so that a high SK signal isobtained from OR-gate 344. The high SK signal is applied to processor503 in FIG. 5 which in turn produces an IC signal to increment counter507. Similarity signal generation is also inhibited responsive to thehigh SK signal during the TN=1 address. In similar fashion, thetemplates for the TN=2 through TN=6 command word template addresses areskipped responsive to the zero entries in the number row of memory 320.

When counter 507 is incremented to its TN=7 state, a low SK signal isobtained from OR-gate 344 and the template signals for the command word"stop" are retrieved from template memory 120. A similarity signal isthen produced in word recognition processor 503 jointly responsive tothe first digit utterance feature signals in latch 501 and the "stop"command word feature signals in latch 505.

The zeros in the TN=8 and TN=9 column for the number row in FIG. 7causes the templates for the command words "error," and "hang up" to beskipped. Responsive to the one signals in the TN=10 through TN=19positions of the number row of FIG. 7, the templates for the digits 0through 9 are successively retrieved from template memory 120.Similarity signals corresponding to these templates are produced. Aftera signal corresponding to the similarity between the first digit of theutterance number string and the stored template feature signal for theTN=19("9") template is produced and stored in processor 503, templatecounter 507 is incremented to its TN=20 state. The TGK output ofcomparator 310 now alerts NOR-gate 342 and the zero code in the numberrow of template control ROM 330 results in a high SK signal from OR-gate344. The SK signal remains high for template addresses TN=20 throughTN=TMAX whereby the name templates in template memory 120 are skippedand no similarity signals are produced for the name templates.

After the TN=TMAX template address is obtained from counter 507,comparator 515 provides an output which resets flip-flop 510. The oneoutput of flip-flop 510 then causes word recognition processor 503 toproduce a template identification signal for the best matched templateof the group including command word "stop" and the digit templates. Inthe event that none of the template similarity signals attain apredetermined threshold, a zero identification signal is generated inprocessor 503. Detection of the WC=0 signal resets processor 210 to thebeginning of the number phase of the directory mode in which the numberinquiry message is repeated. Detection of a WC=7 ("stop") signal bycontrol processor 210 resets the processor to the initial phase of therest mode.

Assume that the identification signal WC=17 corresponding to recognitionof the digit "7" is returned to processor 210 from word recognitionprocessor 503 along with the WRD recognition completion signal.Processor 210 then conditions the circuit of FIG. 1 to advance to thesecond utterance portion identification phase in which the secondutterance portion is recognized as one of the templates from therestricted group consisting of the command word template for "stop" andthe digit templates. The utterance portions of the digit stringutterance are successively identified as described with respect to thefirst utterance portion and the WC identification signals for therecognized number string is stored in control memory 201. The detectionof the WRD recognition completion signal for the seventh recognizedutterance portion by processor 210 switches the circuit of FIG. 1 to itsverification phase in which the recognized number string is fed back tothe user from the coded speech signals in memory 420 via speechsynthesizer 410. A short period (e.g. 2 seconds) for user verificationis then initiated during which the user may utter the command words"error" or "stop."

At the beginning of the verification phase of the directory mode control(t₃₁ in FIG. 12), processor 210 generates an AE signal to enable speechanalyzer 110, a record length RL=S signal (waveform 1203) to provide theshort user response interval and a CM=VER signal (waveform 1201) tocontrol the vocabulary selection for the verification period. Analyzer110 is turned on for a short period and feature signals are generatedtherein for any utterance applied to microphone 101. The FI signal(waveform 1205) from analyzer 110 indicating the completion of thefeature signal generation is supplied to processor 210 via record lengthmodule 225 at time t₃₂. Responsive to the FI signal, processor 210produces an SA start signal (waveform 1209) for word recognizer 130 attime t₃₃.

The verify row of command control memory 320 contains "one" signals inonly the TN=7 "stop" and the TN=8 "error" columns and there is a "zero"signal in the verify row of template control memory 330. Consequentlylow SK signals are obtained for only the TN=7 and TN=8 addressessupplied by template counter 507. For each of these template addresses,the utterance feature signals in latch 501 are compared to the templatefeature signals supplied to latch 505 from template memory 120 in wordrecognition processor 503. The two similarity signals are stored inprocessor 503. The templates for the remaining command words, digits,and names in memory 120 are skipped. The recognition operation isgenerally indicated between times t₃₃ and t₃₅ in FIG. 12.

After the TMAX name template is addressed, the output of comparator 515causes flip-flop 510 to set. Processor 503 is then enabled to generate aWC identification signal (waveform 1217) corresponding to either themost similar template or to no template being within a prescribed degreeof similarity at time t₃₅. If the "error" template is identified anddetected in processor 210, the circuit of FIG. 1 is reset to the numberphase of the directory mode and the user is directed to supply a newutterance for the number string. Detection of the "stop" template inprocessor 210 results in the circuit of FIG. 1 being reset to its restmode. Where the utterance in the verification interval is not identifiedas either "error" or "stop," processor 210 detects the WC=0 signal andthe circuit of FIG. 1 is advanced to the enter phase of the directorymode.

In the enter phase (beginning at t₃₆ in FIG. 12), the codes for thedigits of the identified number in control memory 201 are transferred todirectory control module 221 under control of processor 210 along withan address code DA (waveform 1241) corresponding to the name "Smith." Adirectory write signal DRW (waveform 1243) is also produced by controlprocessor 210 and placed in control module 221. The directory codeinformation is then transferred to directory store 170 as addressed bythe DA signal in control module 221. The transfer is completed at timet₃₇. Between times t₃₇ and t₃₈, a DSD directory storage complete signal(waveform 1245) is supplied to processor 210 via directory module 221.The circuit of FIG. 1 is then reset to its rest state.

The user may alter the telephone number information for a name entry inthe directory repertoire in the directory mode. The modification stateof the directory mode is entered upon recognition of the command word"modify" in the directory mode initial phase. Assume for purposes ofillustration that the user wishes to modify the directory number for thepreviously entered name "Jones." The telephone number previously enteredfor "Jones" was 724-3211. The new number for "Jones" is 724-3522. Upondetection of the identification signal for "modify" by control processor210 in the directory mode initial phase, processor 210 is operative togenerate an AE speech analyzer enable signal, a CM=NAM name commandsignal, and an RL=L record length signal. These signals are transferredto control modules 225 and 227. The AE and L signals are supplied tospeech analyzer 110 which is thereby enabled to receive an utterancesignal from microphone 101.

The user utters the name "Jones" and analyzer 110 produces the sequenceof utterance feature signals corresponding thereto. These utterancefeature signals are transferred to utterance feature latch 501 in FIG.5. Upon the termination of the utterance feature generation, controlsignal FI is sent to processor 210 from analyzer 110 via control module225. Processor 210 detects the FI signal and produces an SA startrecognition signal. The SA signal is supplied to template counter 507and flip-flop 510 in FIG. 5 via transfer control module 233.

As previously described, counter 507 is reset to its TN=1 state.Processor 503 is enabled to produce similarity signals responsive to theutterance feature signals in latch 501 and the restricted vocabularytemplate feature signals selected by the template control logic of FIG.3. The restricted vocabulary consists of the repertory names stored intemplate memory 120 and the command word "stop." The NAM signal fromcontrol module 227 is applied to read only memories 320 and 330 so thatthe name row of these memories is selected. Responsive to the TN=1through TN=6 address signals from counter 507, a succession of high SKsignals is supplied to processor 503 from OR-gate 344. Responsive to thehigh SK signals, counter 507 is successively incremented while processor503 is inhibited from producing similarity signals.

When counter 507 reaches its TN=7 state, the one signal in the TN=7column of the name row of memory 320 causes a low SK signal to beapplied to processor 503. The template feature signals for the commandword "stop" are then transferred from template memory 120 to templatefeature latch 505. Word recognition processor 503 is enabled by the lowSK signal to produce a similarity signal for the "stop" template. Thissimilarity signal is stored in processor 503 and counter 507 isincremented to its TN=8 state.

Counter 507 sequences through its TN=8 through TN=19 states. In each ofthese states, a "zero" signal is obtained from ROM 320 and thecorresponding template feature signals in template memory 120 areskipped. When counter 507 is placed in its TN=20 state, a "one" signalis obtained from template control ROM 330. A low SK signal is producedand processor 503 is enabled. A similarity signal for the first nametemplate feature signals in template memory 120 is thereby generated andstored in processor 503. In like manner, a one signal is obtained foreach name TN address in memory 330 and similarity signals are producedand stored for each name entry in memory 120.

After the similarity signal for the TN=TMAX address is stored inrecognition processor 503, comparator 515 sets flip-flop 510. Processor503 is thereby enabled to produce the WC identification signal andtemplate counter 507 is inhibited. The WC and WRD signals from processor503 are then applied to control processor 210 via transfer controlmodule 233. Processor 210 detects the WC identification signal. Wherethe WC identification signal corresponds to "stop," the circuit of FIG.1 is reset to its rest state. Detection of a WC=0 signal, representativeof the lack of similarity between the utterance and any name, causesprocessor 210 to signal voice response unit 180 to utter the phrase"Repeat the name" and the name state is reentered.

The detection of a name in the directory repertoire causes processor 210to generate an utterance number signal corresponding to the spokenmessage "Please verify." Voice response unit circuit 180 provides thespeech signal corresponding thereto and returns a VRD voice responsedone signal to processor 210. The processor then provides an utterancenumber signal for to the name recognized. Upon completion of theutterance "Jones" by speaker 190, processor 210 is switched to itsverify state during which the user may utter "error," "stop" or otherthan these utterances. Detection of other than these utterances(indicated by a WC=0 signal), causes processor 210 to signal voiceresponse circuit 180 to utter the phrase "Please enter new phonenumber." After the spoken message is completed, the circuit of FIG. 1 isswitched to its number state. The user may then utter the new number forthe name "Jones."

During the number state, utterance feature signals are generated for theutterance sequence received by analyzer 110. These digit utterancefeature signals are successively compared to the restricted vocabulary.The number state restricted vocabulary is shown in the NUM rows of FIG.6 and FIG. 7 and is selected under control of template control logic140. The number state operations for name modification are substantiallysimilar to those described with respect to the addition of a new nameand telephone number to the repertory of the automatic dialer. Theutterance feature signals for each successive digit spoken by the useris compared to the digit template feature signals of the TN=10 throughTN=19. A similarity signal is produced for each comparison and anidentification signal (WC) corresponding to the most similar template istransferred from word recognition processor 503 to control memory 201via control processor 210.

Upon transfer of the identification signal for the last digit utterance,processor 210 signals voice response circuit 180 to provide the message"The new phone number is" followed by the sequence of digitscorresponding to the identification codes just stored in control memory201. Upon receipt of a VRD voice response circuit done signal, processor210 is switched to its verification state. The user can then prevententry of the recognized digit sequence by speaking the command word"error" or reset the circuit of FIG. 1 to its rest state by speaking thecommand word "stop." If neither of these command words are spoken,processor 210 is advanced to the enter phase of the modify operation inwhich the location of directory store 170 corresponding to "Jones" ischanged to the newly recognized digit sequence. Detection of the DSDdirectory store done signal by processor 210 at the end of the directorystorage operation resets the circuit of FIG. 1 to its rest mode.

Upon recognition of the command word "delete" in the initial state ofthe directory mode, processor 210 is switched to its name state. Afterthe name to be deleted is verified, the addressed section of store 170corresponding to the recognized name is erased. The template featuresignals for the recognized name in template memory 120 are removed.Control processor 210 generates a set of signals which are used toinsert an artificial template in place of the deleted signal. Theaddress of the deleted template is stored so that the next insertion ofa name template in the address of template memory 120 will use thedeleted name address.

After templates for command words, digits and names have been producedand stored in template memory 120 in train and directory modes, thecircuit of FIG. 1 may be activated by user utterances to providerepertory and automatic dialing. As aforementioned, the dialer of FIG. 1is normally in its rest mode prepared to recognize an input utterance asone of the command words "off-hook," "directory," and "train." In orderto dial the telephone of a person in the repertoire of the dialerarrangement, the user first utters the command word "off-hook." Aspreviously described, recognition of "off-hook" by the circuit of FIG. 1switches the dialer to its dial mode in which dialing responsive to aname utterance by the user is done.

At the beginning of the dial mode (time t₁ in FIG. 13), processor 210generates a CM=DIA command signal (waveform 1301), an utterance numbercode U_(n) corresponding to a cueing tone and an EM emit signal. TheU_(n) and EM signals (waveform 1303) are transferred via voice responsecontrol module 229 to message store 430 and speech synthesizer 410,respectively. The tone signal is emitted from speaker 190, and the VRDvoice response unit done signal (waveform 1305) from synthesizer 410 isdetected by processor 212 at time t₂. The processor sends an AE signal(waveform 1307) and an RL=L signal waveform (1309) to analyzer 110 attime t₃. The analyzer generates utterance feature signals for the nextutterance received by microphone 101. When the utterance feature signalsare transferred to latch 501, analyzer 110 produces an FI signal(waveform 1311). The FI signal is detected by processor 210 by t₅. Theprocessor then generates an SA recognition start signal (waveform 1313)which is supplied to template counter 507 and flip-flop 510.

Counter 507 is reset to its TN=1 state. Processor 503 is enabled byflip-flop 510 to generate and store a sequence of similarity signalsresponsive to the utterance feature signals from latch 501 and theselected template feature signals from memory 120. The templateselection is restricted in the initial phase of the dial mode responsiveto the CM=DIA dial command code (waveform 1301) from mode control module227. The columns of the dial rows of memories 320 and 330 aresuccessively accessed by the TN address codes from counter 507 and theCM=DIA code from module 227. The bit pattern of the dial rows assuresthat only the TN=7 "stop" template, the TN=10 through TN=19 digittemplates and the name templates from TN=20 to TN=TMAX are retrievedfrom template memory 120 for comparison with the utterance featuresignals in latch 501. All other templates in memory 120 are skippedresponsive to a high SK signal (waveform 1317) from OR-gate 344.

The similarity signals for the selected templates in the dial state arestored in processor 503 and the most similar template is determinedafter the TN=TMAX template is processed. Assume, for purposes ofillustration, that the user utters the name "Smith" and that thetemplate for this name is selected as the most similar template by wordrecognition processor 503. The WC identification signal (waveform 1319)corresponding to "Smith" is returned to processor 210 along with a WRDrecognition done signal (waveform 1321) at time t₉ via transfer controlmodule 233. Upon detection of a name identification signal (t₁₀),processor 210 is switched to its verify state. Signals generated by theprocessor cause voice response circuit 180 to utter the phrases "I haverecognized the name" followed by "Smith." After these messages areemitted by speaker 190 between times t₁₀ and t₁₃. VRD signals (waveform1305) are then returned to processor 210. Analyzer 110 is therebyenabled for a short interval responsive to the RL=S and AE signalsproduced by processor 210 at time t₁₄. The user may then utter "error,"or "stop." Any other utterance or silence produces a WC=0 verificationsignal.

In the verification state, the utterance feature signals of the shortverify interval are compared to the restricted vocabulary defined by theTN=7 and TN=8 entries in command control ROM 320. As shown in FIG. 6 forthe VER row, only the "stop" and "error" templates are retrieved frommemory 120 during the verify mode for comparison with the contents oflatch 501. Between t₁₆ and t₁₉, the sequence of TN address signals(waveform 1315) are produced by counter 507. Only the TN=7 and TN=8address signals between times t₁₇ and t₁₈ are not skipped. Where theutterance "error" is recognized, the circuit of FIG. 1 is reset to thebeginning of the dial mode after the message "Repeat the name" isproduced by voice response circuit 180. The verify state WCidentification signal and WRD recognition done signals occur betweentimes t₁₉ and t₂₀ in FIG. 13. Recognition of "stop" resets the circuitof FIG. 1 to its rest state. Otherwise the circuit of FIG. 1 is advancedto its call state so that the dialing codes for the recognized name maybe outpulsed from dialer pulse former 160.

In the call state, a directory address code corresponding to therecognized name identification signal WC (waveform 1319) between t₁₉ andt₂₀ is produced in processor 210. An RL=L signal and an AE signal aregenerated in processor 210 at time t₂₀. The directory store address (DA)and directory read (DRW) signals are applied to directory store 170 viadirectory control module 221 so that the directory number for "Smith" isretrieved from the directory store. The retrieved directory code isplaced in directory control module 221 and responsive to the directorystore done signal processor 210 transfers the directory code to dialpulse former 160 from dialer control module 223. Pulse former 160 isoperative to generate the dialing signals corresponding to the name"Smith" and to provide the dialing signals to the telephone set 195.

The circuit of FIG. 1 is then conditioned to recognize the command word"hang up" responsive to the CM=CALL command word applied to commandcontrol memory 320. Utterances are continually analyzed during the callstate as indicated in FIG. 13 between times t₂₀ and t₂₆. In the callstate, a low SK signal is obtained only for the "hang up" TN=9 templateas shown in the call row of FIG. 6. The SK signal is low only at t₂₂ andt₂₅ in waveform 1317 of FIG. 13. When the WC identification signal for"hang up" is detected by processor 210 at time t₂₆, the circuit of FIG.1 is reset to its rest mode.

If a digit rather than a name is recognized while the circuit of FIG. 1is in its dial state, processor 210 is switched to its number state inwhich the CM=NUM command signal restricts the template selection to thedigit templates TN=10 through TN=19 and the "stop" template TN=7. Thesequence of digit utterances is recognized as described with respect tothe directory mode and the WC identification signals for the recognizeddigits are stored in control memory 201.

The detection of the WRD recognition done signal for the last expecteddigit utterance in processor 210 causes the processor to generate anemit (EM) signal and a succession of utterance number U_(n) signals. Theutterance number signals correspond to the messages "I have recognizedthe number," the recognized digit sequence, "Please verify," and a tone.These phrases are generated by synthesizer 410 responsive to the spokenmessage coded signals in memory 430. Responsive to the fourth messagecomplete (VRD) signal from synthesizer 410, processor 210 is switchedinto its verify state. Analyzer 110 is enabled by the AE and RL=Ssignals from processor 210 for short interval. The user utterance duringthis interval is converted into feature signals in analyzer 210. Afterthe utterance feature signals are stored in latch 501, an FI featuregeneration complete signal is sent to processor 210.

Processor 210 detects the FI signal and generates a CM=VER commandsignal and an SA recognition start signal. These signals are stored incontrol modules 227 and 233, respectively. The SA signal resets counter507 and enables word recognition processor 503 via flip-flop 510. The TNaddress signals from counter 507 are applied to comparator 310 and tocommand control module 320. The CM=VER signal is supplied to commandcontrol module 320 and template control module 330. As previouslydescribed, the template retrieval from template memory 120 is restrictedto "stop" and "error." Any other utterance or silence is effective as averification of the recognized sequence. Where the similarity signalobtained for the "stop" template feature signals is better than thesimilarity signal for "error" and within a predetermined range, the WCidentification signal corresponding thereto is supplied to processor210. Processor 210 is reset to its rest mode. Return of a WCidentification for "error," switches processor 210 back to the initialphase of the dial mode after a message to repeat the number is generatedin voice response circuit 180.

User verification of the recognized number results in a WC=0 signalbeing detected by processor 210. The processor is then conditioned toconvert the stored recognized digit codes into directory codes which aresupplied to dialer pulse former 160 via control module 223. The dialingsignals obtained from pulse former 160 are transmitted to telephoneinstrument 195 and the telephone call is initiated. Upon completion ofthe telephone connection, processor 210 is placed in its call mode andconditioned to detect the WC identification signal for "hang up." Allutterances are converted into feature signals. Recognition processor 503is conditioned to supply a WC signal corresponding to "hang up" to theprocessor when the prescribed degree of similarity is obtained. Only thetemplate feature signals for "hang up" are retrieved responsive to thecommand word signal CM=CALL supplied to command control memory 320.

While the invention has been shown and described with reference toparticular embodiments thereof, it is understood that variousmodifications may be made by those skilled in the art without departingfrom the spirit and scope of the invention. ##SPC1##

We claim:
 1. A spoken word controlled repertory dialing circuitcomprising:a memory for storing a plurality of template signalsrepresentative of the acoustic features of reference words includingcommand words and repertory words; a directory store for storing a setof dialing signals corresponding to said repertory words; a speechanalyzer responsive to each input spoken word utterance for formingacoustic feature signals representative of said utterance; a spoken wordrecognizer jointly responsive to said input spoken word utteranceacoustic feature signals from said speech analyzer and template signalsfrom said memory for generating a signal identifying said inpututterance; means responsive to the input utterance identifying signalcorresponding to a repertory word template for retrieving thecorresponding dialing signal from said directory store; and meansresponsive to each occurring input utterance identifying signal forselecting a predetermined set of template signals to identify the nextoccurring input utterance.
 2. A spoken word controlled repertory dialingcircuit according to claim 1 further comprising means jointly responsiveto a command word utterance identifying signal and the next occurringutterance identifying signal corresponding to other than one of thepredetermined set of template signals selected for said command wordutterance identification signal for reselecting said predetermined setof template signals selected by said command word utterance identifyingsignal to identify the utterance immediately succeeding said nextoccurring utterance.
 3. A spoken word controlled repertory dialingcircuit according to claim 2wherein said template selecting meansfurther comprises means responsive to a repertory word utteranceidentifying signal for enabling said speech analyzer to receiveutterance signals for a preselected time interval; and means responsiveto said repertory word identifying signal for gating a predetermined setof said preselected time interval word template signals to said spokenword recognizer to generate an identifying signal for said preselectedinterval utterance; and said dialing signal retrieving means comprisesmeans responsive to said preselected interval utterance identifyingsignal corresponding to other than one of said set of preselected timeinterval word template signals for retrieving the dialing signalcorresponding to said repertory word identifying signal from saiddirectory store.
 4. A spoken word controlled dialing circuit accordingto claim 3wherein said speech analyzer is responsive to a sequence ofutterances of isolated words by a speaker for generating a set ofisolated word template signals including command word and repertory wordtemplate signals, and said spoken word recognizer comprises means forinserting the isolated word template signals into said template memoryas said template signals whereby the spoken word dialing circuit issensitive only to isolated utterances of said reference words.
 5. Aspoken word controlled repertory dialing circuit according to claims 1,2, 3, or 4wherein said template signal selecting means comprises: meansresponsive to each first occurring utterance identifying signal fromsaid spoken word recognizer for generating a control signalcorresponding thereto; means responsive to the generation of theacoustic feature signals occurring immediately after each of said firstoccurring utterance identifying signals for producing signals to addressthe template signals in said template memory; and means jointlyresponsive to said utterance identifying control signals and saidtemplate memory addressing signals for applying only a predetermined setof template signals to said spoken word recognizer for identifying saidacoustic feature signals.
 6. A spoken word controlled repertory dialingcircuit according to claim 5 wherein said means to address the templatesignals in said template memory comprises means responsive to thegeneration of the next occurring utterance acoustic feature signals forsequentially generating said template memory addressing signals for alltemplate signals in said memory;said predetermined template signalapplying means comprises means for storing a set of gating signals foreach utterance identification control signal, means jointly responsiveto said control signal and each template addressing signal forretrieving a selected gating signal from said gating signal storingmeans, and means responsive to said selected gating signals fortransferring said predetermined template signals from said templatememory to said spoken word recognizer.
 7. In a spoken work controlledrepertory dialing circuit comprising a memory for storing a plurality oftemplate signals representative of the acoustic features of referencewords including command words and repertory words, a directory store forstoring a plurality of dialing signals corresponding to said repertorywords, a speech analyzer, and a spoken word recognizer;the method ofoutputting dialing signals responsive to input utterances comprising thesteps of generating a set of acoustic feature signals for each inpututterance in said speech analyzer; identifying each input utterance as areference word in said spoken word recognizer jointly responsive totemplate signals from said memory and said input utterance acousticfeature signals from said speech analyzer; retrieving dialing signalsfrom said directory store responsive to each repertory word utteranceidentifying signal; and responsive to each occurring input utteranceidentifying signal, selecting only a predetermined set of templatesignals for identifying the next occurring input utterance.
 8. In aspoken word controlled repertory dialing circuit comprising a memory forstoring a plurality of template signals representative of referencewords including command words and repertory words, a directory store forstoring a plurality of dialing signals corresponding to said repertorywords, a speech analyzer, and a spoken word recognizer; the method ofoutputting dialing signals responsive to input utterances according toclaim 7 wherein said template signal selecting step further comprisesgenerating a first signal responsive to each command word utteranceidentification signal; and, jointly responsive to said first signal andthe next occurring utterance identifying signal corresponding to otherthan one of the predetermined set of template signals selected by saidcommand word utterance identifying signal, reselecting the predeterminedset of template signals selected by said command word utteranceidentifying signal to identify the utterance immediately succeeding saidnext occurring utterance.
 9. In a spoken word controlled repertorydialing circuit comprising a memory for storing a plurality of templatesignals representative of the acoustic features of reference wordsincluding command words and repertory words, a directory store forstoring a plurality of dialing signals corresponding to said repertorywords, a speech analyzer, and a spoken word recognizer;the method ofoutputting dialing signals responsive to input utterances according toclaim 8 wherein said template signal selecting further comprisesenabling said speech analyzer to receive utterances for a preselectedtime interval responsive to a repertory word utterance identifyingsignal, and gating a predetermined set of preselected time intervaltemplate signals to said spoken word recognizer to identify the nextoccurring utterance, and the dialing signal retrieving further comprisesretrieving the dialing signals corresponding to the repertory wordutterance identifying signal from said directory store responsive tosaid next occurring utterance identifying signal corresponding to otherthan one of said predetermined set of preselected time interval templatesignals.
 10. In a spoken word controlled repertory dialing circuitcomprising a memory for storing a plurality of template signalsrepresentative of the acoustic feature signals of reference wordsincluding command words and repertory words, a directory store forstoring a plurality of dialing signals corresponding to said repertorywords, a speech analyzer, and a spoken word recognizer; the method ofoutputting dialing signals responsive to input utterances according toclaim 9wherein said acoustic feature signal generating step comprisesgenerating a set of isolated word template signals responsive to asequence of isolated utterances by a speaker including isolated commandword and repertory word template signals; and said input utteranceidentifying step comprises inserting the isolated word template signalsinto said template memory as said template signals whereby said spokenword dialing is sensitive only to isolated utterances of said referencewords.
 11. In a spoken word controlled repertory dialing circuitcomprising a memory for storing a plurality of template signalsrepresentative of the acoustic features of reference words includingrepertory words; a directory store for storing a plurality of dialingsignals corresponding to said repertory words, a speech analyzer, and aspoken word recognizer;the method of outputting dialing signalsresponsive to input utterances according to claim 7, 8, 9 or 10 whereinthe predetermined template signal selecting step comprises generating acontrol signal corresponding to each first occurring utteranceidentifying signal; generating signals to address template signals insaid template memory responsive to the generation of the acousticfeature signals occurring immediately after each of said first occurringutterance identifying signals; and applying only said predeterminedtemplate signals to said spoken word recognizer to identify saidacoustic feature signals jointly responsive to said utteranceidentifying control signals and said template memory addressing signals.12. In a spoken word controlled repertory dialing circuit comprising amemory for storing a plurality of template signals representative of theacoustic features of reference words including repertory words, adirectory store for storing a plurality of dialing signals correspondingto said repertory words, a speech analyzer, and a spoken wordrecognizer;the method of outputting dialing signals responsive to inpututterances according to claim 11 wherein said template addressing signalgenerating step comprises sequentially generating template addressingsignals for all template signals in said template memory; saidpredetermined template signal applying step comprises storing a set ofgating signals for each control signal; and jointly responsive to eachutterance identifying control signal and each template addressingsignal, applying the gating signal corresponding to the templateaddressing signal and the utterance identifying control signal to saidword recognizer to gate only the predetermined template signals to saidword recognizer to identify the next occurring input utterance.
 13. Aspoken word controlled repertory dialer comprisinga memory for storing aplurality of template signals representative of the acoustic features ofisolated reference word utterances including a set of isolated commandword utterance feature signals and a set of isolated repertory nameutterance feature signals; a directory store for storing a set ofdialing signals corresponding to said set of isolated repertory nameutterance feature signals; a speech analyzer responsive to each inpututterance for forming acoustic feature signals representative of theinput utterance; a spoken word recognizer jointly responsive to saidinput spoken word utterance feature signals from said speech analyzerand template signals from said template memory for identifying the inpututterance as one of said isolated reference word template signals; meansresponsive to the input utterance identifying signal corresponding to arepertory name for retrieving the dialing signal for said repertory namefrom said directory store; and means responsive to each occurring inpututterance identifying signal for applying a predetermined set ofisolated word template signals to said spoken word recognizer toidentify the next occurring utterance.
 14. A spoken word controlledrepertory dialer according to claim 13 further comprising meansresponsive to a repertory name utterance identification signal from saidspoken word recognizer for enabling said speech analyzer to receive aninput utterance for a preselected interval; and said template signalselecting means comprises means for gating a set of preselected intervalisolated word template signals to said spoken word recognizer toidentify said preselected interval input utterance, and said dialingsignal retrieval means comprises means responsive to said preselectedinterval input utterance identifying signal corresponding to other thanone of said preselected time interval isolated word template signals forretrieving the dialing signal corresponding to said repertory nameidentifying signal from said directory store.
 15. A spoken wordcontrolled repertory dialer according to claim 14 further comprisingmeans jointly responsive to a command word utterance identificationsignal and the next occurring utterance identifying signal correspondingto other than one of the predetermined set of isolated template signalsselected for said command word utterance identification signal forreselecting said predetermined set of isolated word template signalsselected for said command word utterance identification signal toidentify the input utterance immediately succeeding said next occurringinput utterance.
 16. A spoken word controlled repertory dialer accordingto claims 13, 14, or 15 whereinsaid predetermined template signalapplying means includes means responsive to each utteranceidentification signal for generating a template group selection signalcorreponding thereto; means for storing a set of gating signals for eachtemplate group selection signal; means responsive to the next occurringinput utterance feature signals for producing a sequence of signals toaddress all template signals in said template memory; means jointlyresponsive to said utterance corresponding template group selectionsignal and said template memory address signals for retrieving aselected gating signal from said gating signal storing means; and meansresponsive to said selected gating signals for transferring only saidpredetermined set of isolated word template signals from said templatememory to said spoken word recognizer. .Iadd.17. A voice responsiverepertory dialing circuit comprising: a memory for storing a pluralityof template signals representative of the acoustic features of referencewords including repertory words; a directory store for storing a set ofdialing signals corresponding to said repertory words; a speech analyzerresponsive to each input spoken word utterance for forming acousticfeature signals representative of said utterance; a spoken wordrecognizer jointly responsive to said input spoken word utteranceacoustic feature signals from said speech analyzer and template signalsfrom said memory for generating a signal identifying said inpututterance; means responsive to the input utterance identifying signalcorresponding to a repertory word template for retrieving thecorresponding dialing signal from said directory store; and meansresponsive to each occurring input utterance identifying signal forselecting at least one template signal to identify the next occurringinput utterance..Iaddend. .Iadd.18. A voice responsive repertory dialingcircuit comprising:a memory for storing a plurality of template signalsrepresentative of the acoustic features of reference words includingrepertory words; a directory store for storing a set of dialing signalscorresponding to said repertory words; a speech analyzer responsive toeach input spoken word utterance for forming acoustic feature signalsrepresentative of said utterance; a spoken word recognizer jointlyresponsive to said input spoken word utterance acoustic feature signalsfrom said speech analyzer and template signals from said memory forgenerating a signal identifying said input utterance; means responsiveto the input utterance identifying signal corresponding to a repertoryword template for retrieving the corresponding dialing signal from saiddirectory store; and means responsive to other than repertory wordtemplate input utterance identifying signals for selecting at least onetemplate signal to identify the next occurring input utterance..Iaddend.